On efficient mutual nearest neighbor query processing in spatial databases

نویسندگان

  • Yunjun Gao
  • Baihua Zheng
  • Gencai Chen
  • Qing Li
چکیده

This paper studies a new form of nearest neighbor queries in spatial databases, namely, mutual nearest neighbor (MNN) search. Given a set D of objects and a query object q, an MNN query returns from D, the set of objects that are among the k 1 (P1) nearest neighbors (NNs) of q; meanwhile, have q as one of their k 2 (P1) NNs. Although MNN queries are useful in many applications involving decision making, data mining, and pattern recognition, it cannot be efficiently handled by existing spatial query processing approaches. In this paper, we present the first piece of work for tackling MNN queries efficiently. Our methods utilize a conventional data-partitioning index (e.g., R-tree, etc.) on the dataset, employ the state-of-the-art database techniques including best-first based k nearest neighbor (kNN) retrieval and reverse kNN search with TPL pruning, and make use of the advantages of batch processing and reusing technique. An extensive empirical study, based on experiments performed using both real and synthetic datasets, has been conducted to demonstrate the efficiency and effectiveness of our proposed algorithms under various experimental settings. This paper studies a new form of nearest neighbor (NN) queries, namely mutual nearest neighbor (MNN) search. Given a dataset D, a query point q, and two parameters k 1 and k 2 , an MNN query retrieves those objects p 2 D such that p 2 NN k1 (q) 1 and q 2 NN k2 (p), i.e., it requires each answer object 2 to be one of the k 1 nearest neighbors (NNs) to q and meanwhile has q as one of its k 2 NNs. Consequently, it considers not only the spatial proximity of the answer objects to q, but also the spatial proximity of q to the answer objects. In other words, the conventional NN query is asymmetric, while MNN retrieval is symmetric. Although it is well known that asymmetric NN search fits the requirements of lots of applications, there are still many other practical applications that require symmetric NN queries. Some real-life applications are presented as follows. Resource allocation. Consider that a logistic company A has six branches (labeled as p 1 , p 2 , p 3 , p 4 , p 5 , p 6), as shown in Fig. 1a. In order to guarantee the quality of service, company A assigns each branch two nearby branches as backup to provide necessary supports in …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient k Nearest Neighbor Queries on Remote Spatial Databases Using Range Estimation (Draft Version)

K-Nearest Neighbor (k-NN) queries are used in GIS and CAD/CAM applications to find the k spatial objects closest to some given query points. Most previous k-NN research has assumed that the spatial databases to be queried are local, and that the query processing algorithms have direct access to their spatial indices, e.g. R-trees. Clearly, this assumption does not hold when k-NN queries are dir...

متن کامل

A k-Nearest Neighbor Search Algorithm for Enhancing Data Privacy in Outsourced Spatial Databases

With the advancement of cloud computing technologies and the propagation of locationbased services, research on outsourced spatial databases has been spotlighted. Therefore, the traditional spatial databases owners want to outsource their resources to a service provider so that they can reduce cost for storage and management. However, the issue of privacy preservation is crucial in spatial data...

متن کامل

Indexing the Solution Space: A New Technique for Nearest Neighbor Search in High-Dimensional Space

ÐSimilarity search in multimedia databases requires an efficient support of nearest-neighbor search on a large set of highdimensional points as a basic operation for query processing. As recent theoretical results show, state of the art approaches to nearest-neighbor search are not efficient in higher dimensions. In our new approach, we therefore precompute the result of any nearest-neighbor se...

متن کامل

Efficient k Nearest Neighbor Queries on Remote Spatial Databases Using Range Estimation

K-Nearest Neighbor (k-NN) queries are used in GIS and CAD/CAM applications to find the k spatial objects closest to some given query points. Most previous k-NN research has assumed that the spatial databases to be queried are local, and that the query processing algorithms have direct access to their spatial indices; e.g., R-trees. Clearly, this assumption does not hold when k-NN queries are di...

متن کامل

Rapid AkNN Query Processing for Fast Classification of Multidimensional Data in the Cloud

A k-nearest neighbor (kNN) query determines the k nearest points, using distance metrics, from a specific location. An all k-nearest neighbor (AkNN) query constitutes a variation of a kNN query and retrieves the k nearest points for each point inside a database. Their main usage resonates in spatial databases and they consist the backbone of many location-based applications and not only (i.e. k...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Data Knowl. Eng.

دوره 68  شماره 

صفحات  -

تاریخ انتشار 2009